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[Name of the Document] Specification 
[Title of the Invention] 

METHOD AND APPARATUS FOR EXTRACTED DIGITAL 
WATERMARKING DATA STATISTICAL PROCESSING, AND A PROGRAM 
5 STORAGE MEDIUM 
[Claims ] 

1 . An extracted digital watermarJc data 
statistical processing method for reconstituting the 
digital watermarJc data embedded in data contents, 
10 comprising the step of: 

reconstituting the digital watermarJc data embedded 
in the data contents from a data sequence prior to the 
reconstitution embedded from the data contents by means 
of a detection method on the basis of a binary 
15 distribution in statistics. 



2 . The extracted digital watermarlc data 
statistical processing method as claimed in claim 1, 
further comprising the steps of: 
20 presetting a reliability threshold value of the 

extracted digital watermarlc data; 

calculating an appearance probability of the 
extracted digital watermarlc data prior to the 
reconstitution in the data contents, on the basis of the 
25 digital watermark data obtained from the bias of 
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appearance probability of the digital watermark data in 
the binary distribution of appearance probability of 
each bit of 1 bit sequence extracted at random from 
digital data contents; and 
5 reconstituting digital watermark data by using 

majority decision processing if said appearance 
probability or 1-appearance probability exceeds 
threshold value, and determining that there is no 
watermark or the presence is unknown if said appearance 
10 probability or 1-appearance probability does not exceed 
the threshold value. 



3 . The extracted digital watermark data 
statistical processing method as claimed in claim 2, 

15 further comprising the step of: 

judging from the appearance probability of the extracted 
digital watermark data prior to the reconstitution in 
the data contents in order to reconstitutes the 
extracted digital watermark data. 

20 

4 . The extracted digital watermark data 
statistical processing method as claimed any of the 
claims 1 through 3, further comprising the step of: 

obtaining the digital watermark data and its 
25 reliability from a bias of the appearance probability of 
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the extracted digital watermark data prior to the 
reconstitution in the binary distribution obtained from 
the appearance probability of each bit of 1 bit sequence 
extracted at random from digital data contents of the 
5 appearance probability of the extracted digital 
watermark data prior to the reconstitution. 

5. The extracted digital watermark data 
statistical processing method as claimed any of the 

10 claims 1 through 4, further comprising the steps of: 
modulating in advance the data sequence actually 
embedded as the digital watermark data; and 

demodulating said bit sequence by said pseudo- 
random sequence which is used for embedding the digital 
15 watermark prior to the digital data reconstituting 
processing . 

6. The extracted digital watermark data 
statistical processing method as claimed in the claim 5, 

20 further comprising the step of: 

fixing both of the each bit of the appearance 
probability of each bit of 1 bit sequence extracted at 
random from digital data contents to 1/2. 

25 7. An extracted digital watermark data 
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statistical processing apparatus for reconstituting the 
digital watermark data embedded in data contents , 
comprising : 

means for reconstituting the digital watermark data 
5 embedded in the data contents from a data sequence prior 
to the reconstitution embedded from the data contents by 
means of a detection method on the basis of a binary 
distribution in statistics. 

10 8. The extracted digital watermark data 

statistical processing apparatus as claimed in claim 7, 
wherein said extraction means further comprising 

means for obtaining a binary distribution from a 
digital watermark data length and each bit of the 
15 appearance probability of each bit of 1 bit sequence 
extracted at random from digital data contents; 

means for extracting the digital watermark sequence 
with respect to the each bit of said data contents; 

means for obtaining from the appearance probability 
20 of the extracted digital watermark data; 

means for judging whether the obtained appearance 
probability is larger the reliability threshold value; 
and 

means for reconstituting the digital watermark data 
25 of which the data is judged as larger than said 
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threshold value. 

9 . The extracted digital watermark data 
statistical processing apparatus as claimed in claim 8, 
5 wherein said extraction means further comprising : 
means for obtaining the reliability of said 
reconstituted data bit from said obtained appearance 
probability and outputting together with the 
reconstituted data bit. 

10 

10. The extracted digital watermark data 

statistical processing apparatus as claimed in any of 
the claims 7 to 9 , further comprising: 

means for modulating in advance the data sequence 

15 actually embedded as the digital watermark data by means 
of pseudo-random sequence; and 

means for demodulating said bit sequence by said 
pseudo-random sequence which is used for embedding the 
digital watermark prior to the digital data 

20 reconstituting processing. 

11. An extracted digital watermark data 
statistical processing program storage medium for 
reconstituting the digital watermark data embedded in 
25 data contents, comprising: 
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a computer performance means for the reconstituting 
processing of the digital watermark data embedded in the 
data contents from a data sequence prior to the 
reconstitution embedded from the data contents by means 
5 of a detection method on the basis of a binary 
distribution in statistics . 

12. The extracted digital watermark data 
statistical processing program storage medium as claimed 
10 in claim 11, wherein said program further . comprising : 
processing means for presetting a reliability 
threshold value of the extracted digital watermark data, 
and 

calculating an appearance probability of the 
15 extracted digital watermark data prior to the 

reconstitution in the data contents, on the basis of the 
digital watermark data obtained from the bias of 
appearance probability of the digital watermark data in 
the binary distribution of appearance probability of 
20 each bit of 1 bit sequence extracted at random from 
digital data contents; and 

processing means for reconstituting digital 
watermark data by using majority decision processing if 
said appearance probability or 1-appearance probability 
25 exceeds threshold value, and determining that there is 
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no watermark or the presence is unknown if said 
appearance probability or 1-appearance probability does 
not exceed the threshold value. 

5 13 . The extracted digital watermark data 

statistical processing program storage medium as claimed 
in claim 12, wherein said program further comprising: 
processing means for judging from the appearance 
probability of the extracted digital watermark data 
10 prior to the reconstitution in the data contents in 

order to reconstitutes the extracted digital watermark 
data . 

14. The extracted digital watermark data 
15 statistical processing program storage medium as claimed 
in any of the claims 11 through 13, wherein said program 
further comprising : 

processing means for obtaining the digital 
watermark data and its reliability from a bias of the 
20 appearance probability of the extracted digital 

watermark data prior to the reconstitution in the binary 
distribution obtained from the appearance probability of 
each bit of 1 bit sequence extracted at random from 
digital data contents of the appearance probability of 
25 the extracted digital watermark data prior to the 
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reconstitution . 

15. The extracted digital watermark data 
statistical processing program storage medium as claimed 

5 in any of the claims 11 through 14, wherein said program 
further comprising : 

processing means for modulating in advance the data 
sequence actually embedded as the digital watermark data, 
and demodulating said bit sequence by said pseudo-random 
10 sequence which is used for embedding the digital 

watermark prior to the digital data reconstituting 
processing . 

16. The extracted digital watermark data 

15 statistical processing program storage medium as claimed 
in claims 15, wherein said program further comprising: 

processing means for fixing both of the each bit of 
the appearance probability of each bit of 1 bit sequence 
extracted at random from digital data contents to 1/2. 

20 

[Detailed Description of the Invention] 
[Field of the Invention] 

It is easy to replicate or tamper fraudulently 
with multimedia production, and the easiness hinders an 
25 data content provider from sending data. In addition. 
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some users may not use the data originated from the 
provider validly. Therefore, copyright protection is 
strongly needed for the multimedia production. The 
digital watermarking technique is effective in realizing 
5 the copyright protection. According to the digital 
watermarking technique, sub-data is embedded in data 
contents without being noticed by a user by utilizing 
redundancy of data such as of an image and a sound. The 
digital watermarking technique is used for protecting a 
10 multimedia copyright by embedding copyright information, 
a user ID and the like as the sub-data in secret, since 
it is difficult to separate the sub-data from the data 
contents . 
[Prior Art] 

15 Conventional techniques are proposed in 

Japanese patent applications No. 8-305370, No. 8-338769, 
No. 9-9812, No. 9-14388, No. 9-109924, No. 9-197003, No.9- 
218467 and No. 10-33239. The digital watermark method is 
also called data hiding, finger printing steganography , 

20 image/sound deep encryption and the like. As for a 

digital watermarking system, accuracy for determining 
the presence or absence of embedded data is important. 
In addition, reliability of embedded data is important. 
The digital watermarking system generally has a 

25 mechanism for reconstituting correct digital watermark 
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data even when sub-data embedded in the data contents is 
corrupted to a certain extent, since the digital 
watermarking system assumes various processing on the 
watermarked data contents. However, under present 
5 circumstances, it is impossible for the system to 

evaluate validity of reconstituted digital watermark 
data quantitatively. Therefore, the system does not 
have enough reliability. 
[Object of the Invention] 

10 It is an object of the present invention to 

evaluate quantitatively probabilities of cases that data 
contents which do not contain digital watermark data are 
wrongly judged as containing digital watermark data, and 
incorrect digital watermark data is read from 

15 watermarked digital data contents. 
[Means to Solve the Problems] 

Digital watermark processing is comprised of a 
pair of digital watermark embedding/digital watermark 
extraction. In digital watermark embedding processing, 

20 digital watermark embedding area B^A is selected from 
the digital watermark target area in data contents by 
means of secret key information so that data in area B 
is changed by an inherent rule. In digital watermark 
extraction processing, data of the digital watermark 

25 embedding area B is interpreted and the digital 
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watermark data is reconstituted. The present invention 
judges the probability of occurrence of the digital 
watermark data read from the watermark embedding area B 
by means of the correct secret key data, based on a 
5 binary distribution in statistics of digital watermark 
data read by means of any secret key data regardless of 
true or false from A which is an overall digital 
watermark target area, by means of the digital watermark 
algorism which is to be an applied target of the 
10 invention, in the data contents embedded with digital 
watermark . 
Effect 

The present invention, in an digital watermark 
technique, can evaluate the credibility of the watermark 

15 data read from the data contents, can judge whether the 
data contents have the digital watermark or not, and can 
suppress the probability of reading incorrect digital 
watermark in an certain value from data contents which 
the digital watermark is included. 

20 [Embodiments of the Invention] 
Embodiment 1 

Before explaining embodiments of the present 
invention, definition of some words will be given. 
^^Digital watermark data sequence'' represents a data 

25 sequence read from the digital data contents before 
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being reconstituted. '^^Digital watermark data" represents 
significant data for system operation, which data needs 
to be embedded in the digital data contents, or, data 
obtained by reconstituting the digital watermark 
5 sequence. ''Reliability a of digital watermark" is an 
index representing validity of read digital watermark 
data. That is, it represents a probability that the 
read digital watermark data matches with the actual 
embedded digital watermark data. Conversely, a 
10 probability of reading digital watermark data from an 
image without digital watermark data or reading 
erroneous digital watermark data can be represented as 
2 (l-a) . 

Similarly, ''embedded sequence" represents data 
15 to be actually embedded. The embedded sequence includes 
sequence of embedded data which is modulated, extended 
or repeated. In addition, "read" may be replaced with 
"extract" in some cases. 

Fig.l shows a digital watermarking system of 
20 the present invention. In the system shown in Fig.l, 
digital watermark data 101 is embedded in digital data 
contents 103 by a digital watermark embedding apparatus 
102, then, converted into watermarked digital data 
contents 104. 

25 The watermarked digital data contents 104 are 
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degraded to watermarked digital data contents 105 by 
compression or image processing while the watermarked 
digital data contents 104 are distributed by wireless or 
wire communication or by a packaged medium. 
5 A digital watermark reading apparatus 107 

reads a watermark sequence from the degraded watermarked 
digital data contents 105, and reconstitutes digital 
watermark data 108. 

Fig. 2 is a block diagram of the watermark 

10 reading apparatus 107. The digital watermark data 

reconstitution apparatus 10 8 provided in the watermark 
reading apparatus 107 obtains the probability q that bit 
1 is read when any 1 bit watermark sequence is read from 
a whole watermark area beforehand by using the watermark 

15 reading apparatus 107. 

Specifically, assuming a 1 bit watermark 
sequence reading part 501, the part 501 reads the 
watermark sequence 1 bit by 1 bit from all elements of 
the whole watermark area (a broken line Ll) , and 

20 calculates the ratio of the number of bit 1 to the 
number of all trials. 

In the embodiment, the reading probability of 
bit 1 and the number of bit 1 are obtained. However, it 
is possible that the reading probability of bit 0 and 

25 the number of bit 0 are obtained. Basically, there is 
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no difference between the former and the latter. The 
difference is only on implementation. 

Accordingly, the probabilities of detecting 
bit 0 and 1 when reading 1 bit at random in the 
5 watermark area by using the digital watermarking 

algorithm is calculated to be 1-q and q respectively. 

The n bit watermark sequence reading part 502 
reads the digital watermark data sequence from the 
watermarked digital data contents for the number of 
10 total times of embedding digital watermark data. 

Here, digital watermark data is defined as bo, 
bi, bm-i, bi ^{0, 1}, i<m (m bit length), the 
repeating number of embedding ith bit of the digital 
watermark data in the digital data contents is defined 
15 as n i , the read watermark sequence is defined as 

b'o.Of b'o,l, ".b'o,nO-l, b'i,o, b'i,i, ... b'i,nl-l/... /b'm-l,Or b'm- 
1^1, ... b'm-l,nm-l -1 ^irj 

^{0, 1} (XiTo bit length). 

The data reconstitution apparatus 108 receives 
a subsequence of the digital watermark data sequence one 
20 after another from a subsequence corresponding to 0th 

digital watermark data to a subsequence corresponding to 
(m-l)th digital watermark data (a solid line L2) . 

Next, the method for reconstituting ith bit of 
the digital watermark data will be described concretely. 
25 When ni bits of digital watermark data 
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sequence is read at random from the watermark area, the 
probability P (x=k) of k '1' bits appearing in the ni bit 
sequence is represented by the binary distribution 
density function 
5 P(x=k)=niCkq^- (l-q)""-"^ (1) 

and the distribution function of that, F (x) , is 

F(x)= 2Lo niCkq^- (l-q)""-*^ (O^x^m). (2) 
Here, niCk is the number of combinations when selecting 
k out of n i . 

10 Setting a reliability threshold value ct (l/2<a 

^1) of the digital watermark data, the number of bit 1 
included in a subsequence b'i,0/ b'i,i, ...b'i,ni-i 
corresponding to ith digital watermark data is 
calculated by 

15 ki = 2!!lo b'i,r . 

Then, digital watermark data is determined in the 
following way by using the formula (2) : 

r 0 when O^F(ki)^l-a 

20 bi = I 1 when a^F(ki)^l (3) 

unknown or when l-ct< F(ki)<a 

not present 

Viewing from a different angle, when 
determining by the number of bit 1 included in the 

25 watermark sequencer! j , if the largest integer xo that 
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satisfies O^F{x = xo) and the smallest integer xi 

that satisfies a^F(x = xi) ^1 are assumed to be 
threshold values, the digital watermark data is judged 
as shown- in Fig. 3 such that if the number of 1 in ni is 
5 equal to or smaller than xo , the digital watermark data 
is 0, and that if the number of 1 is equal to or larger 
than xi, the digital watermark data is 1. 

The horizontal axis of Fig. 3 represents the 
number of bit 1 included in the watermark sequence, and 

10 the vertical axis represents frequency of the 

corresponding number. As for unwatermarked digital data 
contents, the frequency that bit 1 appears in a bit 
sequence read at random from the digital data contents 
becomes binary distribution. Thus, the peak of the 

15 frequency is at the half point of the number of bits. 
On the other hand, as for watermarked contents, in the 
subsequence ni in which bit 0 is embedded as digital 
watermark data, the frequency of bit 1 is 0 if there is 
no degradation and it is a small number which is equal 

20 to or smaller than xo even if there is degradation. In 
the subsequence ni in which bit 1 is embedded as digital 
watermark data, the frequency of bit 1 is nl if there is 
no degradation and it is a large number which is equal 
to or larger than xi even if there is degradation. In 

25 this way, the distribution of the frequency of bit 1 or 
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bit 0 in the watermarked sequence is leaning to one side 
from the center of the binary distribution. The present 
invention uses the lean for reconstituting digital 
watermark data from the read watermark sequence. 
5 Depending on a watermarking system, a 

following method can be used. That is, reconstituted 
digital watermark data is obtained by using the bias 
from the central value of the distribution P (x) of the 
watermark sequence extracted from digital data contents 

10 105. Next, the probability of appearing the read 

watermark sequence is calculated by the formula (2) . 
Then, if the reconstituted digital watermark data is 1, 
F(ki) can be added to watermark dada as the reliability, 
and, if the reconstituted digital watermark data is 0, 

15 l-F(ki) can be added. The reliability F (ki) and l-F(ki) 
of the digital watermark data is obtained from the bias 
of appearance probability of the digital watermark data 
in the binary distribution of appearance probability of 
each bit of 1 bit sequence extracted at random from 

20 digital data contents. 

Fig. 4 shows a concept in which the length of 
the digital watermark data is extended to m bits. 

The digital watermark data reconstitution 
apparatus 108 outputs the reconstituted digital 

25 watermark data bo, bi , ... , bm-i as read digital watermark 
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data 10 6. 

Fig. 5 is a flowchart showing the above- 
mentioned process. The process will be described in the 
following with reference to Fig. 5. 
5 Watermarked digital data contents 105 and key 

data which is necessary for reading digital watermark 
data is input, and a digital watermark data sequence is 
extracted with respect to each bit value in step 1 . 
Then, a threshold value a of the reliability is set in 

10 step 2, and a probability q that bit 1 appears when 1 

bit of digital watermark data is read at random from the 
whole watermark area is obtained in step 3. Then, a 
binary distribution function F (x) which represents 
probability that x bits of 1 are included in the bit 

15 sequence is obtained from the probability q and the 
repeating number n j of each bit of digital watermark 
data in step 4 . 

Then, 0 is assigned to i which distinguish a 
subsequence of the digital watermark data sequence in 

20 step 5. Next, the number of bit 1 in the subsequence is 
obtained as ki = 2^r=o ^'i.r and the appearance 
probability F(ki) is obtained, then it is determined 
whether F(ki) is equal to or less than 1-ck in step 6. 
If F(ki)^l-a, the digital watermark data w' i is 

25 reconstituted as 0 in step 7. Then, i is incremented by 
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1 in step 8, and the process goes back to step 6 if i<m 
in step 9. If F(ki)^l-Q; is not true in step 6, it is 
checked whether F(ki)^CK is true in step 10. If F (ki) ^ 
a , the digital watermark data w j is reconstituted as 1 
5 in step 11, and the process goes to step 8. If F(ki)^ct 
is not true in step 10, the process ends by determining 
as there is no watermark or the presence or absence is 
unknown in step 12. If i is more than ni in step 9, a 
reconstituted watermark sequence {w' i} is output. In 

10 the above process, the reading process in step 1 can be 
carried out between step 4 and step 5. In step 6, it is 
checked whether l-F(ki) is more than a. 

In the first embodiment, it is assumed that 
there is no bias in the distribution represented by 

15 formula (1), that is, q=l/2. 

When the embedding number ni of each bit of 
digital watermark data is adequate for obtaining a 
statistical characteristic, it becomes q = 1 / 2 
generally. However, since the value of q depends on 

20 characteristics of an watermarking algorithm and digital 
data contents, q may take a value deviating largely from 
1/2 in some rare cases. A method for solving this 
problem will be described in a second embodiment. 
Second Embodiment 

25 In the following, the second embodiment will 
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be described. Fig. 6 is a block diagram of a 
watermarking system of the second embodiment. 

The watermark embedding apparatus 102 embeds 
digital watermark data 101 in digital data contents 103. 
5 At the time, when embedding each bit value ni times 

repeatedly, watermark sequence is modulated and embedded 
in the digital data contents 103. The modulation is 
carried out by a pseudo-random sequence generator (A) 
501 which is provided in the watermark embedding 
10 apparatus 102. 

For example, when assuming the embedding 
sequence as bo,0/ bo,ir —bcno-ir bi,0/ bi,i, ... bi,ni-i,... ,bm-i,Of 
bm-i,if ."bm-i,nm-i -1 bi,j ^{0, 1}, and the pseudo-random 

sequence as ri^o. ...ri,ni-i bi,j ^{0, 1}, the 

15 embedding sequence is modulated to 
mi,o, rui,!, ...mi,ni-i 

= bi,j ( + ) ri^j 
by the pseudo-random sequence. A(+)B represents XOR of 
A and B . 

20 According to the above-mentioned process, the 

same pseudo-random sequence is necessary for digital 
watermark data reading. 

For example, if 1 bit watermark sequence is 
read by using an M-sequence as the pseudo— random 

25 sequence, it becomes q=l/2. Therefore, the present 
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invention can be applicable without depending on the 
watermarking algorithm and digital data contents . 

When digital watermark data reading, 
demodulation is carried out as b'i,j = mi,j ( + ) ri,j by 
5 using a pseudo-random sequence generator (B) 502 which 
is provided in the watermark reading apparatus 106. 

Here, the pseudo-random sequence generator 
(A) 501 and the pseudo-random sequence generator (B) 
502 needs to be implemented such that both of the 
10 generators generate the same pseudo-random sequence. 

Watermark data is reconstituted with the 
method of the first embodiment from the watermark 
sequence b'o,0/ b'o,i. ...b'o,no-i. b'i,o/ b'i,i, ... b'i,ni- 

1,." ,b'm-l,0/ b'm-1,1/ ".b'm-l,nm-l -1 bi,j ^{0, 1} 

15 obtained by the demodulation. 

Since it is considered that the appearance 
probability q of bit 1 in the watermark sequence can be 
approximated by the binary distribution regardless of 
the presence or absence of modulation, there is no 

20 influence on the distribution of the density function 
(1) due to the modulation shown in this embodiment. 

In addition, q=l/2 can be assumed in 
implementation, that is, no process is necessary for 
obtaining q. Therefore, the amount of processing that 

25 is required for watermark recons titution thus becomes 
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the same as that for majority decision processing. Thus, 
the reconstitution process becomes faster. 
Third Embodiment 

In the following, a third embodiment will be 
5 described. In the third embodiment, an example will be 
described showing concrete values on the basis of the 
first embodiment and the second embodiment. In this 
embodiment, it is assumed that digital watermark data is 
1 bit, the repeating number n of embedding is 127 and 

10 the probability q that bit 1 is read when reading 1 bit 
watermark sequence at random from the whole watermark 
area is 1/2. If the threshold value a is 0.99999 
(which means 99.999%), xo in Fig. 21 is 36 and xi is 90. 
That is to say, according to the present invention, 

15 under the above-mentioned condition, digital watermark 

data is judged as bit 0 if the number of ^1' appeared in 
the watermark sequence {n bits) is equal to or less than 
36, and it is judged as bit 1 if the number of '1' 
appeared in the watermark sequence (n bits) is equal to 

20 or more than 90, and it is judged that there is no 

watermark data or the presence or absence is unknown in 
other cases. If it is judged that there is digital 
watermark data, the correctness of more than 99.999% can 
be ensured. 

25 In the following, examples of experiments will 
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be shown. In the following experiments, an image of 
"lena" which has 128X128 pixels is used as a test image, 
and the threshold value a of the reliability is assumed 
to be 0 . 999999 . 
5 First Experiment 

In this experiment, 1 bit digital watermark 
data '1' was embedded 127 times repeatedly using key 
data '50,000', and the watermark sequence was read with 
various key data. Fig. 7 shows the number of bit '1' in 

10 the read watermark sequence corresponding to the key 

data. In Fig. 7, the vertical axis shows the number of 
bit '1' in the read watermark sequence, and the 
horizontal axis shows the key data value. In this 
experiment, the appearance frequency of bit ^1' in the 

15 watermark area A was q=0. 492247. 

When correct key data (50,000) is used, it is 
judged that digital watermark data is ^1' with 99.9999% 
correctness since the number of bit *1' is more than the 
threshold value for judging the presence of watermark. 

20 When incorrect key data is used, it is judged that there 
is no watermark data or the presence or absence is 
unknown . 

Second Experiment 

In the second experiment, a watermark sequence 
25 which was modulated with a 7 stage M-sequence (initial 
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state is 64) was embedded, and a similar experiment as 
the first experiment was carried out with various key 
data and M-sequences of various initial states. The 
result is shown in Fig. 8. By carrying out the 
5 modulation, the value of q becomes 0.500000 from 
0.492247, and the variance becomes 31.718777 from 
31.008265. Thus, the values are almost not changed from 
those of the first experiment. It is only when correct 
key data and correct pseudo-random sequence are used 

10 that digital watermark data can be read. In addition, 
when the watermark sequence is embedded in half data of 
the watermark area A, q=0. 741547 with the modulation and 
q=0. 499768 without the modulation. 
[Advantages of the Invention] 

15 The effects of the present invention 

corresponding to the second object is as follows. 

(1) There are following effects by judging 
digital watermark data on the basis of the binary 
distribution in statistics : 

20 - The probabilities of following cases can be 

evaluated quantitatively. The cases are: digital data 
contents which do not contain digital watermark data are 
wrongly judged as containing digital watermark data, and 
incorrect digital watermark data is read from 

25 watermarked digital data contents. In addition, the 
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probability can be suppressed within 2(1- a) by using 
the reliability threshold ct of digital watermark data. 

(2) There are following effects by modulating 
digital watermark data by a pseudo-random sequence 
5 before embedding the digital watermark data: 

- The bias of the probability q of reading bit ^1' when 
1 bit watermark sequence is read at random from the 
whole watermark area . 

- It becomes difficult to detect the presence or 

10 absence of watermark data and the value from the bias of 
q without the correct key data and the pseudo-random 
sequence, the key data being necessary for reading 
digital watermark data and the pseudo-random sequence 
being necessary for demodulating read watermark sequence. 

15 - In an implementation, since it can be assumed to be 
q=l/2, the amount of processing that is required for 
watermark reconstitution becomes the same as that for 
majority decision processing. Thus, the speed of the 
processing becomes higher. 

20 a is an index which represents a lower limit 

of the correctness rate of read digital watermark data, 
and is manageable in the digital watermarking system. 
Therefor, the method of using a is superior to a 
conventional method of showing the correctness rate of 

25 read digital watermark data to a user. 
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The present invention becomes more effective 
in combination with an error correction code. That is, 
when a part of bits in digital watermark data is 
intensively corrupted, it is judged that only the part 
5 of bits is unknown and other bit data is in high 

correctness rate. Therefore, correct data can be read 
by correcting only the corrupted bit data. 
[Brief Description of the Drawings] 

Fig.l is a diagram showing an overview of a 
10 digital watermarking system; 

Fig. 2 is a diagram showing an overview of 
digital watermarking extracted apparatus ; 

Fig. 3 is a diagram showing a judgment of 
digital watermarking data; 
15 Fig. 4 is a diagram showing an overview of 

digital watermarking data reconstitution ; 

Fig. 5 is a flowchart showing steps of a 
digital watermark extraction process; 

Fig. 6 is a diagram showing an overview of the 
20 second embodiment of this invention; 

Fig. 7 is a diagram showing result (no 
modulation) of the watermark sequence readings; 

Fig. 8 is a diagram showing result (with 
modulation) of the watermark sequence readings. 
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[Name of the Document] Abstract 
[Abstract] 

[Object] Object of this invention is to quantitatively 
evaluate the probability of reading the incorrect 
5 digital watermark from the data contents which include 
the digital watermarlc. 

[Solution Means] When ni bits of digital 

watermarlc data sequence is read at random from the 
watermark area, the probability P (x=k) of k ^1' bits 
10 appearing in the ni bit sequence is represented by the 
binary distribution density function 

P(x=k)=niCkq''- (1-q)"^"'' (1) 
and the distribution function of that, F (x) , is 
F(x)= Sa=o niCkq^- (1-q)""''' (O^x^m). (2) 
15 Here, niCk is the number of combinations when selecting 
k out of n i . 

Setting a reliability threshold value a(l/2<a 
^1) of the digital watermark data, 
[Selected Figure] Fig. 3 
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